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Presently, flight systems designed to perform payload-centric maneuvers require pre- 
constructed procedures and special hand-tuned guidance modes. To enable intelligent 
maneuvering via strong coupling between the goals of payload-directed flight and the 
autopilot functions, there exists a need to rethink traditional autopilot design and function. 
Research into payload directed flight examines sensor and payload-centric autopilot modes, 
architectures, and algorithms that provide layers of intelligent guidance, navigation and 
control for flight vehicles to achieve mission goals related to the payload sensors, taking into 
account various constraints such as the performance limitations of the aircraft, target 
tracking and estimation, obstacle avoidance, and constraint satisfaction. Payload directed 
flight requires a methodology for accurate trajectory planning that lets the system anticipate 
expected return from a suite of onboard sensors. This paper presents an extension to the 
existing techniques used in the literature to quickly and accurately plan flight trajectories 
that predict and optimize the expected return of onboard payload sensors. 


1. Introduction 

P resently, flight systems designed to perform payload-centric maneuvers require pre -constructed procedures and 
special hand-tuned guidance modes. To enable intelligent maneuvering via strong coupling between the goals 
of payload-directed flight and the autopilot functions, there exists a need to rethink traditional autopilot design and 
function. Research into payload directed flight (PDF) examines sensor and payload-centric autopilot modes, 
architectures, and algorithms that provide layers of intelligent guidance, navigation and control for flight vehicles to 
achieve mission goals related to the payload sensors, taking into account various constraints such as the performance 
limitations of the aircraft, target tracking and estimation, obstacle avoidance, and constraint satisfaction. 

A central problem to address in payload directed flight is to control a known and controllable plant interacting 
with an external system based on payload and sensor data feedback that gives partial observation and understanding 
of the external system, to satisfy mission objectives and constraints on the combined system. This research focuses 
on trajectory generation and flight control under varying constraints in a highly dynamic environment, autonomous 
feature detection and estimation, and modeless autopilot design concepts for multi -objective system control. 
Application of this research is targeted towards increasing capabilities, performance, and efficiency in the execution 
of missions that require payload-directed and target-directed maneuvering. 

This central problem is shown conceptually in the block diagram in Figure 1, where a controllable system is 
coupled with an external system which may be unmodeled or poorly modeled for various reasons. These reasons 
may include complexity, uncertainty, lack of observability from sensor to state, the size of the external system’s 
state may overwhelm computational and modeling resources, or lack of available data to generate a model. A suite 
of sensors provide some set of observations into the system, and a set of mission objectives are defined concerning 
the combined system. The PDF research objectives seek methods, tools, and techniques for designing controllers 
around these blocks to ensure the combined system meets mission objective under varying constraints. 
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Figure 1. Payload Directed Flight Problem 

PDF mission concepts require simultaneous payload data loop closure at multiple levels in the control system 
hierarchy, from high-level cognitive deliberative decision making down to low-level reactive continuous -time 
control. The PDF architecture defines three distinct layers of loop closure: the outer layer, which comprises of the 
mission planner and scheduler, the middle layer, which comprises optimal trajectory calculations, and the inner 
layer, where filtered sensor data cuts directly into the autopilot system, providing for instance attitude command 
queues. The PDF architecture middle layer problem has two main constituents: computing optimal trajectories, and 
incorporating knowledge representation of the observed system. Optimal trajectory calculations play a key role in 
the concepts of Payload Directed Flight, particularly in the middle layer PDF architecture, that concerns planning 
behaviors on the time-scale of minutes. 


A. Payload Missions Requiring Middle Layer Planning 

The requirements for loop closure at the middle PDF layer require the onboard systems be able to compute 
trajectories under complex constraints and objectives. For instance, automated formation flying aircraft need to be 
able to represent complex phenomena - other aircraft, wing tip vortices, weather patterns - in calculating trajectories 
that will allow an aircraft to join and follow a formation, while the lead aircraft needs to be able to compute 
trajectories that provide the optimal paths for the convoy. A large number of possible applications for payload 
directed flight are outlined in Ref. 1 . 

A more complex example is shown in Figure 2. Consider an aircraft whose mission is to locate, identify, and 
monitor a hazardous smoke plume, utilizing onboard body fixed imagers. At the lowest level of control, direct 
control of the actuators can be utilized to center the image in the view frame. At the middle layer, sensor data is 
used to update an online environment model, and trajectories are planned to field the sensors at locations that 
maximize expected data return while navigating safely around the hazardous phenomena. At a higher level, mission 
objectives can be planned, scheduled, and prioritized based on what is being observed from the sensors. 
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Figure 2. Mid Level Loop Closure in Hazardous Sensing Missions 





The airborne earth science community regularly field missions that require flight planning to investigate large 
scale highly complex phenomena, fielding onboard payload sensor suites from theoretical predictive models. 
Currently, human intuition and judgment is required to interpret the data and develop flight plans to maximize the 
expected data return. The 2008 Arctic Research of the Composition of the Troposphere from Aircraft and Satellites 
(ARCTAS) mission featured three aircraft - a DC-8, P-3, and B-200 - fielding highly sophisticated scientific 
payloads to investigate a variety of scientific themes, including long-range transport of pollution to the arctic. 
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Figure 3. ARCTAS Flight Planning and Execution Model 


The authors observed the planning model in Figure 3 during the 2008 ARCTAS deployment fielded out of 
Fairbanks, Alaska, which was based on earlier INTEX-B 2 3 flight planning process. Participants in this mission 
included project managers and principle investigators who oversaw operation of the mission, platform operators and 
flight crew who operated the airborne platforms, instrument operators and experimentalists who operated sensor 
equipment and interpreted data in real-time to provide to the managers during flight, and theoretical investigators 
and modelers who produced data products from chemical transport models and general circulation models (CTMs 
and GCMs) for chemical and meteorological predictions. The flight plans also required aircraft underflights of 
satellites for ground truthing and validation. Satellite and model science teams were involved at all stages of pre- 
mission flight planning, flight execution, and post-mission data interpretation. Flight planning relied heavily on 
CTM/GCM simulation, analysis of satellite observations, meteorological and chemical forecasts from several 
modeling teams, analyses of near real-time satellite data and satellite validation needs, inputs from the aircraft 
science teams, and reviews of progress towards meeting mission objectives. The flight planning process required 
planners to estimate sensor return expected from the flight paths based on the various data products. A PDF middle 
layer planner could integrate into this model as a first step for validation of these algorithms, providing suggested 
plans to the project managers and principle investigator teams both in the pre-planning phase and during flight 
testing as shown in Figure 4. 



Figure 4. Conceptual Integration for PDF Technology 


B. Related Approaches 

There are many examples in the UAV literature of closing the loop around payload sensors. In Ref. 4 and 5, 
vision systems are used to extract road centerlines. Here the road identification process is achieved through image 
processing that exploits the linear or locally linear nature of roadways. Once the road center line is extracted, the 
tracking problem is reduced to a simple path following problem. The traditional approach to this problem is to 
formulate an inner loop controller using linear control theory and design an outer loop controller that uses the 
desired path to specify the desired bank angle or lateral acceleration. Nonlinear control laws have been proposed for 
solving this problem 4 (similar control results are also found in Refs. 6-9). Additionally, similar research tracking 
rivers and shorelines is presented in Refs. 10 and 1 1. An approach for tracking a ground vehicle is presented in Ref. 
12. In Ref. 13, a UAV serves as a communication repeater in a larger communication network. To achieve this, the 
UAV orbits a radio or communication ground source maintaining a fixed signal-to-noise ratio (SNR) with that 
source. The problem is analogous to the contour following problem with in-situ sensors. Presented in Ref. 13 is a 
traditional PID based control law formulated around the SNR error between time steps. This formulation, as 
demonstrated through simulation, will cause the UAV to spiral towards the specified contour (specified SNR value) 
and remain there once it is acquired. However, implicit in the formulation of this control law is the assumption that 
the SNR field is monotonic. In many of the cases already presented, this assumption was not applicable and 
necessitated the development of behavioral approaches. 

The re-tasking problem for UAS’s has also been explored in the literature. In Ref. 14 a list of targets to service is 
treated as a Traveling Salesman Problem. The problem of incorporating vehicle dynamics is achieved by solving the 
Traveling Salesman Problem using heuristics from traditional combinatorial optimization and then alternating the 
paths between targets as linear paths and minimum Dubins paths 16 . Several competing approaches using Dubins 
minimum paths are also available in the literature 14 " 18 . 

In Ref. 19, the notion of planning based on sensor swath was explored. In this work a UAS has a downward 
facing camera with a field of view. The problem is to determine the optimal tour through the targets such that all 
targets are observed. Unlike the TSP problem where the UAS passes through all the targets, this is a case where it is 
only necessary for the targets to pass through the sensor’s field of view. To solve this problem the UAS was 
modeled as a Dubins car with a discrete set of inputs u operating over a finite At. The operational space for the UAS 
was then explored using this model and the learning A* algorithm operating with an admissible heuristic. A similar 
problem as this was also considered in Ref. 20. Here, however, the operational space is explored using a 
probabilistic planning approach based on the Rapidly-Exploring Random Root Tree algorithm 21 ' 22 . 

Some of the research in the vein of payload directed flight for fixed-winged vehicles is focused on searching for 
targets with an unknown location. One example in Ref. 23 is probabilistic in nature and involves selecting a search 
space, discretizing that search space as a grid of cell locations, applying a probability that a target is in a given cell, 
and identifying the optimal path (in a probabilistic sense) for identifying targets. The research in this field is 



currently focused on optimal searching with multiple UAVs and optimizing the target identification or mapping 
ability over many vehicles 24 . 

The approach developed in this paper will utilize a numerical technique from the path planning literature to solve 
problems posed in an optimal control formulation. The algorithm and approach described below extends previous 
algorithmic techniques to include highly accurate vehicle trajectory generation under a general set of dynamic, state, 
and input constraints. Our approaches differs from the state of the art by utilizing continuous-time feedback control 
systems designed specifically to solve the trajectory generation problem utilizing the most accurate vehicle models 
available, or to the level of fidelity desired. In essence, this approach favorably trades run-time complexity for 
offline control system design. As a consequence of this approach, input control actuation is a product of the 
trajectory generation process and only dynamically feasible trajectories will be generated, rather than searching over 
discretized subset of inputs and generating highly simplified, unverified trajectories. 


C. Payload Directed Flight Architecture 

Consider the overall PDF system architecture, shown in the figure below. 



The trajectory planning algorithms presented in this paper are focused on the mid-level loops, ignoring the mission 
planning and scheduling blocks (which are the focus of the outer layer), and abstracting the lower -level autopilot 
controller in the mner layer. The PDF middle layer architecture is shown in the figure below, grouped into three 
conceptual components. The knowledge system is responsible for maintaining knowledge and belief world, using 
sensor inputs, state information, apriori beliefs, environment models, etc. The trajectory planning and optimization 
system is responsible for plotting trajectories from one state to intermediate goal states given by the higher level 
mission planner. The trajectory sequence then is sent to the autopilot system. These three systems are cascaded to 
form the mid-level closed-loop system in the PDF architecture. 
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Figure 6. PDF Middle-Layer Architecture 
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2. Problem Formulation and Algorithm Outline 

Consider a vehicle system P whose dynamics are governed by a series of non-linear differential equations of the 
following form, where xe s Jl n , ue9T ye s 31 p . 

[y = hi,u,t^ 

The model is augmented for vehicle or payload constraints, equality or inequality constraints in the set 
G=(G e ,G 0, of the form 

G e 4% X, U , tj= 0 and Gj C, X, U, t <s 0 

Additionally, the system is subject to a set of boundary constraints B 4c, t ^<0 specified on the state variables of 
the system, and can be time dependent. Sensors and mission objectives can be modeled and input into this algorithm 
as integral cost functionals $(x,u,t). A final state cost L(Xf,tf) can also be added. The objective function can be 
specified as 

( \ 


J j =Li( r t f j-'Yj J <fs i i(,u,t'^t 


Given an initial state x 0 , and a subset of the final state Xj - , (where xcx is the goal space), find the continuous 

set of control inputs and the associated trajectories from x 0 to that minimize the objective function J, subject to 
constraints in G and B. 


A. General Approach 

Our approach to solution of this problem is as follows. 


1. 

2. 

3. 


Define the plant P, given by P \ 


\x = f4c,u,t 
\y = hit,u,t 


Define the state space subsets. Define the goal space x as a subset of x, and define the search 

* _ A 

space X as a subset of the state space that contains the goal space; i.e., XCXCX. 

Define the augmented plant P’ by incorporating the plant constraints G into the plant model. 


4. Design a closed loop system G’ - the branch trajectory controller system - that can control the 
adjoined plant P’ from an initial state x 0 ex, to a destination point in the goal space x^. E X , 
utilizing the controller U. 

5. Incorporate/implement G’ in the online trajectory search algorithm outlined below. 

The closed loop control system G’ is used to compute trajectories that represent branches in a search tree, similar 
to Ref. 22. Our modified Tree Search Algorithm is as follows. 


Function TREE-SEARCH ( x G , t 0 , x goa i, ' 
Inputs 

*0 

Start state 

to 

Start time 

Xgoai 

Goal state 

C T 

Constraint set 

T 

Search Tree 


1. TREE CLEAR ( T ) 

2. x root = TREE GET ROOT ( T ) 

// Try to connect the xO to the goal point with trajectoiy S’=(x(t),u(t), t) 

3. S’= GENERATEBRANCH ( G\ x root , * goal ) 

4. If CHECK TRAJECTORY AG AINST C ONSTRAINT ( S’, C T ) do 

5. S’ = TRIM TRAJECTORY (S’, C T ) 

End if 

6. If S’ is not empty, do 

7. TREEADD ( T, S’, NULL) 

End if 

//Main Loop 

8. While more time left to search, do 

// Generate branch to random point. 

9. * rad = RANDOM_POINT(* ) 

10. ** = GETCLOSESTPOINT ( T, * md ) 

11. S’= GENERATEBRANCH ( G’, x*. jrrnd) 

12. If CHECK_TRAJECTORY_AGAINST_CONSTRAINT ( S’, CT ) 

13. S’ = TRIM_TRAJECTORY(S’, CT) 

End if 

14. If S’ is empty, do 

1 5 . Next/Continue While 
End if 

16. TREE ADD ( T, S’, NULL) 

// Generate branch from end of previous branch to goal state 

17. = GET LEAF ( S’ ) 

18. S’ = GENERATE BRANCH ( G\ x\ x goa i ) 

19. If CHECK TRAJECTORY AGAINST CONSTRAINT ( S’, CT ) 

20. S’ = TRIM TRAJECT ORY (S’, CT) 

End if 

21. If S ’ is not empty 

22. TREE ADD ( T, S’, NULL) 

End if 

Next While 


Figure 7. TREE-SEARCH Algorithm 


Algorithm TRIMTRA JEC T ORY ( S’, C T ) 

23. Trim branch S’ to remove points of collision and imminent collision with C T 

24. Generate avoidance/recovery maneuver branch S* 

25. If avoidance/recovery maneuver branch collides 

26. Discard S’ and S* branches. 

Else 

27. Add S* to end of S’ 

End if 

Figure 8. TRIM TRA JECT ORY Algorithm 

B. Plume Pointing Problem Example 

The trajectory generation system was implemented in simulation and tested as a navigation solution for the plume - 
pointing problem outlined in Figure 2. The aerial vehicle starts from a ground position with no concept of where the 
smoke plume is. The vehicle is constrained to an approved flight area. The vehicle system must observe the plume 
through body-fixed imagers, develop an internal model of the plume, and compute trajectories safely around the 
plume - withm the approved flight area - that allows the vehicle to train the imagers on the plume in order to update 
its internal model. The simulation scenario was implemented in the Reflection Architecture, including a custom 
smoke particle generation rendering system, as shown in Figure 9. 



Figure 9. Simulation of a Smoke Plume Monitoring Experiment. 

Body-fixed camera looking out the right side of the vehicle, under the wing. Box represents area to avoid. 

This phase of testing is only concerned with the trajectory generation system and trajectory following controller. To 
that extent, real-time sensor feedback from the camera sensors has not yet been integrated. In this experiment, the 
area of the plume that the vehicle wishes to navigate is fixed, and the areas to avoid are also fixed. The aircraft must 
compute trajectories that avoid the smoke plume, the ground, and stay within the approved airspace. 

C. Trajectory Generation Problem 

The first part of the problem is to consider the constraints. Consider an aircraft system that must navigate from 
some initial state x and reach a goal position Pe91 3 . The aircraft must avoid obstacles while performing this 
maneuver, which are specified as a set of inequality trajectory constraints in C T . Additionally, the aircraft dynamics 
are limited: the ailerons can only be deployed in a limited range, and the elevator rates are limited. 
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Figure 10. A Simple Trajectory Calculation Exercise 


The aircraft is required to make this maneuver while maximizing fuel efficiency. We choose not to have the goal 
position incorporated into the cost function cp at the final time (terminal error), but rather this constraint will be 
handled by the search algorithm. We will use a quadratic form for the cost function, which was chosen for ease of 
implementation for later problems. Consider the following cost formulation, where the sensor pointing cost terms, 
Lds and L^, are defined in section D. 


J = \ f i T Aj x +u T BjU + L ds +L dr ^T ; B,= 


1 


10 


The trajectory problem is stated as follows. Find the continuous set of control inputs and the associated trajectories 
ffomxo to Xj that minimize the objective function J, subject to constraints C and S. 

1 . Plant Definition 

First, we must define the plant. This solution will use the dynamics of the Exploration Aerial Vehicle (EAV) at 
NASA Ames Research Center as the model. The EAV is a small unmanned aircraft designed for subscale testing of 
vehicle control systems. The linear lateral and longitudinal dynamics for the EAV at 25 m/s and 130m altitude were 
estimated through a series of flight tests. The system states and inputs are given by 




X = Vat 

The lateral dynamics for the aircraft were estimated to be 

x iat =VP &P Ar 
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The longitudinal mode were estimated to be the following. 


u = 


Vat 


U 


Ion 


U ,at = 


ail 




r dr . 


~ -1.0176 

0.0728 

-0.8887 

0.2918' 


' 0.3404 

0.5297' 

-16.7704 

-5.2969 

1.2992 

0 

; B,. t = 

55.2736 

4.6948 

13.7306 

-0.3138 -1.3475 

0 

5 LAT 

-1.4038 

-9.7679 

0 

1 

0 

0 


0 

0 



X Ion 

<£> 

<1 

<1 

is 

< 

ii 

T 

i U lon = 


I I 

k 

< 



" -1.409 - 

■0.040515 0 

0.096803' 



'-0.0699 

3.24 


-0.0281 

-4.0540 

1.2896 

0 



-0.8405 

-0.03057 

Aon ~ 

-0.0566 

-20.9832 

-1.6784 

0 

5 

^lon ~ 

-25.99 

-0.07424 


0 

0 

1 

0 



0 

0 


In order to incorporate position into this system, we augment the system with the state vector p=[x y z], where p is 
the position vector of the aircraft in a north/east/down coordinate system. We assume a local flat inertial non- 
rotating earth assumption, and augment the system with the kinematic relationship, where p,0e9t\ and \\j is the 
Euler yaw angle. 

■ t d _ 


y E z l 


’ dt 


p = Ov r 


d sin^ cos(Z> 

— y/ = — — 

dt cos# cos# 

L r 

Although this is written as a linear relationship, the O is a nonlinear function of the Euler angles and wind axis 
angles, given by 

O f , 0 , (//, a, P > R ba2ned f, 0, i// \, l2ba <*, /? J 0 o! 

The transformations from body axis (BA) to wind axis (WI) to world axis (NED) is given by 

X <^sXif. 

2. Space Definitions 

The dynamic/kinematic system is formulated with a state space vector xeW 12 and control input space ue'.H 4 , where 
the system model’s state and input vectors were given by 

x =t > V X la, X lonl ; u, at =%S ail A S rdr A S de A 8 thr 

Here, 

x ion = t v r Acr A q AG T _x lat =%P A p A r A<f) T _ 

This problem is formulated with a goal state given as the final desired Cartesian position, independent of all other 
states. The search space and goal space are defined as 


y E z l 


3. Augmented Plant 

Next we develop the augmented system P\ Our original plant P can be modeled in block diagram form as shown in 
Figure 1 1 . Constraints C P have been added to the input, and are shown with a filled background. 
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Figure 11. Vehicle Plant Model 


4. Branch Trajectory Controller 

The next step in our approach is to design a closed-loop controller/plant system G’ that can control the adjoined 
plant P’ from an initial state x 0 ex, to a destination point in the goal space X j G X . 

There are a few different autopilot system modes we can use to implement the trajectory computation algorithm. 
We will base this control problem on based on the existing heading -command autopilot system designed for the 
EAV. This autopilot system has the option of implementing either a "track-to" or "direct-to" autopilot mode. 




Figure 12. Two Candidate Autopilot Control Modes for Waypoint Following 

Track-To and Direct-To from initial state to desired goal position 


The ‘direct-to’ autopilot mode will guide the aircraft directly to the waypoint based on heading command from the 
current aircraft state. This autopilot design is a little simpler, as a single arctan evaluation is needed to compute the 
heading error at any point in time. An autopilot block diagram is shown below. Here, roll command is used in favor 
of roll-rate command for increased margins and implementation on the real EAV system. 



Figure 13. Direct-To Autopilot Design 

Each block shown in the figure above is a controller block which can vary in form; here we consider the classic 3- 
parameter PID form for the transfer function. 


Block . . = x + ^-+ K,s 
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Figure 14. Transfer function for Transform Blocks (Simple PID Block) 

The TRACK-TO autopilot system is similar in fonn to a DIRECT-TO autopilot system, except for the additional 
calculation of a cross-track error as an error signal to drive to zero. The block diagram is shown below. 



TRACK-TO Autopilot System 


Figure 15. TRACK-TO Autopilot Design 


The crosstrack error (XT eiT ) is used to calculate a delta heading angle (ij/ A ) which is the deviation from the 
nominal heading angle (v|/ nom ). This relationship is shown graphically below. Also, the line distance calculation 
block is also shown. 
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Figure 16. DetaOs of XTE Calculations 


Our exact vehicle system could be used for these computations, but effectively a heading control system will 
turn the complex 6-DOF vehicle system with controller into some sort of low-pass filter. The next step is to develop 
the autopilot and simulation test with this reduced model. The Matlab Simulink implementation of this controller is 
shown in Figure 19 below. The diagrams in Figure 20 shows the output of a test script, which generated a random 
path, and a set of random initial conditions for the aircraft. The resultant trajectories computed by the closed-loop 
system are shown. 


D. Sensor Models 

The implementation in Figure 19 contains a “Cost Model” block. This block contains the computations for compute 
the derivative of the cost function at any point in time and state of the vehicle system. Three different cost models 
are included in this test. The first cost model penalizes sensor pointing inaccuracy of the sensor towards a specified 
target, as well as pointing distance, as illustrated in Figure 17 below. 



Let a sensor axis frame Fs be defined as a fixed frame relative to the aircraft, with the y-axis pointing in the direction 
of the sensor. Let R^i represent the 3x3 transformation matrix from Fs to FI, where FI is the local axis system. Let 
Ri 2 W represent the rotation matrix from FI to Fw, where Fw is the world axis system (here we assume selection of an 


appropriate world axis system, such as a world-fixed east/north/up axis system). Let the position of the vehicle be 
given by P ve hicie ? and the position of the observed point be specified by P obs . Then the directional cost incurred from 
point inaccuracy, L&, is computed by the following equation. The resulting costs for random trajectories - when 
only Lcb directional costs are taken into account - are shown in Figure 21. 
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In addition to pointing accuracy, the sensors need to maintain a specific distance to the target. This is modeled as a 
curve that falls off with the square of the distance to the target position. Let R des be the desired distance from the 
sensor to the phenomena, and let Ra ev be the deviation distance, then the resulting equation is and graph is shown in 
Figure 18 below. The resulting costs for random trajectories - when only L ds distance costs are taken into account - 
are shown in Figure 2 1 . 



Figure 18. Cost Distance Model and Equation 

Shown for Rd es = 780 , Rdev=S 0 



Figure 19. Simulink Model for Trajectory Control System 
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Figure 20. Trajectory Response for Randomly Generated Points 
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Figure 21. Pointing Cost Derivative for Trajectories. 

Sensor is pointing to the right of the vehicle. 
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Figure 22. Distance Cost Derivative for Vehicle Trajectories. 


3. Simulation Results 

The controller was tested simulation using the Reflection Architecture, with the entire mission scenario (except for 
landing) being controlled by the trajectory planner. The Matlab Simulink model was converted into C++ code using 
the Real-Time Workshop Embedded Coder product from Mathworks. The tree-branching algorithm was allowed to 
plan for 5 seconds. The planning system looked ahead of the current time by 10 seconds, and repeated the planning 
algorithm every 20 seconds. The system was tested on several systems with the Tenderer, visualizations, and full 
UAV simulation being run on the same computer. The test was run on an Intel Core 2 X9650, 3 GHZ, 3GB RAM. 
The screenshots shown in Figure 23 shows a typical screenshot of the entire system running, with the visualization 
rendering 500 branches, and associated costs shown in color: red represents higher costs, blue represents lower 
costs. The trajectory planner immediately guides the aircraft to the optimal track, which is a right hand turn to allow 
sensor monitoring of the plume, at the distance Rd ev from the target position. Ail trajectories found avoid the 
plume’s bounding volume and guides the aircraft safely inside the approved fly zone. To reliability of the system in 
real-time, the flight management system can be disengaged and the aircraft can be hand piloted away from the 
plume and commanded to a hazardous orientation; as soon as the autopilot and planner is reengaged, the trajectory 
planner immediately computes the appropriate control inputs and trajectories to recover from the unusual orientation 
and guides the aircraft safely back to the optimal track. For a test lasting 30 minutes, the search algorithm averaged 
48,202 nodes (states) added to the tree, with 3435 branches, taking 15,002,030,502 CPU clock cycles. The 
trajectory planner was running in its own thread and had frill utilization of one of the processors. 



Figure 23. Screenshot of Planner and Branches. 

Branches are shown as red/blue lines, where red represents higher costs and blue represents lower costs. The green 

line represents the trajectory selected. 
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Figure 24. Takeoff Trajectories (Overhead shot shown on right). 

Costs build up fast because of the distance to the plume , but the selected trajectoiy immediately takes the vehicle to 

the ‘optimal ’ path for sensor pointing. 

4. Conclusion and Future Work 

The results of the simulation were promising. The vehicle was able to successfully navigate the area and keep 
the smoke plume in view 100% of the time after the aircraft establishes its position in the ‘optimal’ track. Constraint 
satisfaction was maintained for 100% of the time after takeoff, and constraints (plume collision volume, ground 
volume, and approved airspace volume) were never violated. 

As follow up work on this project, this simulation will be extended to complete the mission objectives 
completely autonomously. 

1 . Integrate the camera sensor processing module and environment knowledge database to close the loop fully 
around this scenario, where the plume position is not known, and the mission is executed fully 
autonomously. 

2. Integrate a higher fidelity non-linear 6-DOF model of our UAV for trajectory planning. 

3. Develop more accurate tracking with a linearized trajectory controller to follow the trajectories. \ 

4. Extend the trajectory generation algorithm to include swappable cost modules for different sensors, and 
multiple sensors. 

5. Extend the generation algorithm to select from several different closed-loop control systems, based on a 
heuristic. 

6. Perform a point-sampling maneuver into the ‘less-dense’ areas of the plume, and a ground sensing 
maneuver over a specified target area. 

7. Flight test these algorithms at NASA Ames Research Center/Moffett Airfield on the X-SCAV 
(experimental Sensor Controlled Aerial Vehicle) plug-and-play UAV (see Figure 25). 
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